[1]. Zhenwei Shao, Zhou Yu, Meng Wang, Jun Yu, “Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023. [CCF A类会议]
[2]. Zhou Yu, Lixiang Zheng, Zhou Zhao, Fei Wu, Jianping Fan, Kui Ren, Jun Yu*, “ANetQA: A Large-scale Benchmark for Fine-grained Compositional Reasoning over Untrimmed Videos”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023. [CCF A类会议]
[3]. Zhou Yu, Zitian Jin, Jun Yu*, Mingliang Xu, Hongbo Wang, Jianping Fan, “Bilaterally slimmable transformer for elastic and efficient visual question answering”, IEEE Transactions on Multimedia, 2023. [SCI 一区期刊]
[4]. Yuhao Cui, Zhou Yu*, Chunqi Wang, Zhongzhou Zhao, Ji Zhang, Meng Wang, Jun Yu, ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge, ACM International Conference on Multimedia (ACM MM), 2021. [CCF A类会议]
[5]. Zhou Yu, Yuhao Cui, Jun Yu*, Meng Wang, Dacheng Tao, Tian Qi, Deep Multimodal Neural Architecture Search, ACM International Conference on Multimedia (ACM MM), 2020. [CCF A类会议]
[6]. Zhou Yu, Jun Yu*, Yuhao Cui, Dacheng Tao, Tian Qi, Deep Modular Co-Attention Networks for Visual Question Answering, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. [CCF A类会议]
[7]. Zhou Yu, Dejing Xu, Jun Yu*, Ting Yu, Zhou Zhao, Yueting Zhuang, Dacheng Tao, ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering, AAAI Conference on Artificial Intelligence (AAAI), 2019. [CCF A类会议]
[8]. Zhou Yu, Jun Yu*, Chenchao Xiang, Jianping Fan, Dacheng Tao, Beyond Bilinear: Generalized Multimodal Factorized High-order Pooling for Visual Question Answering, IEEE Transactions on Neural Networks and Learning Systems (T-NNLS), 29 (12): 5947-5959, 2018. [SCI一区,ESI高被引]
[9]. Zhou Yu, Jun Yu*, Chenchao Xiang, Zhou Zhao, Qi Tian, Dacheng Tao, Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding, International Joint Conference on Artificial Intelligence (IJCAI), 2018. [CCF A类会议]
[10]. Zhou Yu, Jun Yu*, Jianping Fan, Dacheng Tao, Multi-modal Factorized Bilinear Pooling with Coattention Learning for Visual Question Answering, International Conference on Computer Vision (ICCV), 2017. [CCF A类会议]